Phonetic Dictionaries, Speech Synthesis, Linguistic Resources, Audio Processing
MuFFIN: Multifaceted Pronunciation Feedback Model with Interactive Hierarchical Neural Modeling
arxiv.orgยท4h
Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
arxiv.orgยท4h
SpeechCT-CLIP: Distilling Text-Image Knowledge to Speech for Voice-Native Multimodal CT Analysis
arxiv.orgยท1d
Mouse Sensors Can Pick Up Speech From Surface Vibrations, Researchers Show
it.slashdot.orgยท1d
Loading...Loading more...